Model Selection

COCO Dataset

# COCO Dataset

YOLOv10x is the latest version of the YOLO series, focusing on real-time end-to-end object detection, offering higher detection accuracy and faster inference speed.

Object Detection

YOLOv10 is a real-time end-to-end object detection model developed by the Tsinghua University team, based on the latest improved version of the YOLO series.

Object Detection

YOLOv10 is a real-time end-to-end object detection model developed by the Tsinghua University team, representing the latest improvement in the YOLO series.

Object Detection

YOLOv10 is a real-time end-to-end object detection model proposed by Tsinghua University, known for its efficiency and accuracy.

Object Detection

YOLOv10 is a real-time object detection model that achieves efficient and overhead-free object detection by eliminating post-processing steps such as Non-Maximum Suppression (NMS).

Object Detection

Mask2former Swin Tiny Coco Panoptic

Mask2Former is a Transformer-based unified image segmentation model supporting instance segmentation, semantic segmentation, and panoptic segmentation tasks, utilizing masked attention mechanism to enhance performance

Image Segmentation

Mask2former Swin Small Coco Panoptic

A small-scale version of Mask2Former based on Swin backbone network, optimized for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Large Coco Panoptic

A large-scale version of Mask2Former based on the Swin backbone network, specifically trained for panoptic segmentation tasks on the COCO dataset

Image Segmentation

Mask2former Swin Base Coco Panoptic

The Mask2Former model based on the Swin backbone network, trained on the COCO panoptic segmentation dataset, adopts a unified paradigm to handle instance segmentation, semantic segmentation, and panoptic segmentation tasks.

Image Segmentation

Mask2former Swin Large Coco Instance

Mask2Former is a Transformer-based unified image segmentation model, utilizing a Swin-Large backbone and fine-tuned on the COCO dataset, specializing in instance segmentation tasks.

Image Segmentation

Oneformer Coco Dinat Large

A unified single Transformer architecture for image segmentation, supporting three major tasks: semantic segmentation, instance segmentation, and panoptic segmentation

Image Segmentation

Yolos Small 300

A small-sized YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing Vision Transformer architecture for efficient object detection

Object Detection

Yolos Small Dwr

A YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing a Vision Transformer architecture, suitable for object detection tasks.

Object Detection

A vision Transformer (ViT)-based object detection model trained with DETR loss function, achieving excellent performance on the COCO dataset.

Object Detection

YOLOS model fine-tuned on the COCO 2017 object detection dataset, utilizing Vision Transformer architecture for efficient object detection.

Object Detection

Detr Resnet 50 Panoptic

DETR is an end-to-end object detection model based on Transformer architecture, using ResNet-50 as the backbone network, trained on the COCO dataset, and supports object detection and panoptic segmentation tasks.

Image Segmentation

Maskformer Swin Large Coco

Large-scale MaskFormer model based on Swin backbone network, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Small Coco

A small MaskFormer model based on the Swin backbone network, trained on the COCO dataset for panoptic segmentation tasks.

Image Segmentation

Maskformer Swin Tiny Coco

A panoptic segmentation model trained on the COCO dataset, using a unified paradigm to handle instance/semantic/panoptic segmentation tasks

Image Segmentation

Maskformer Swin Base Coco

A panoptic segmentation model based on the Swin backbone network, trained on the COCO dataset, unifying instance/semantic/panoptic segmentation tasks

Image Segmentation

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase